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Abstract: In this short note, we show how to use concentration inequalities in order to build exact 
confidence intervals for the Hurst parameter associated with a one-dimensional fractional Brownian 
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1 Introduction 

Let B = {B t : t > 0} be a fractional Brownian motion with Hurst index H £ (0, 1). Recall that this 
means that B is a real-valued continuous centered Gaussian process, with covariance given by 

E{B t B s ) = \{s 2H +t 2H -\t-s\ 2H ). 

The reader is referred e.g. to |12j for a comprehensive introduction to fractional Brownian motion. 
We suppose that H is unknown and verifies H < H* < 1, with H* known (throughout the paper, 
this is the only assumption we will make on H). Also, for a fixed n > 1, we assume that one observes 
B at the times belonging to the set {k/n; k = 0, . . . , n + 1}. 

The aim of this note is to exploit the concentration inequality proved in [10] , in order to derive an 
exact (i.e., non-asymptotic) confidence interval for H. Our formulae hinge on the class of statistics 

n-l 

S n = {Bh+2 - 2B h±1 + B*) 2 , n>l. (1.1) 

k=Q 

We recall that, as n — + oo and for every H 6 (0, 1), 

n 2 "- 1 S n -» 4 - 4 H , a.s.-P, (1.2) 

(see e.g. [5]), and also 

Z n = n 2H -^S n - V^(4-4 H ) (1.3) 

' 2jr (B»±a - 25*+! + B,) 2 - (4 - 4 ff )) N(0, c H ), (1.4) 
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where N(Q, ch) indicates a centered normal random variable, with finite variance cjj > depending 
only on H (the exact expression of ch is not important for our discussion) . We stress that the CLT 
(|1.4p holds for every H £ (0, 1): this result should be contrasted with the asymptotic behavior of 
other remarkable statistics associated with the paths of B (see e.g. [3] and [1]), whose asymptotic 
normality may indeed depend on H. The fact that Z n verifies a CLT for every H is crucial in order to 
determine the asymptotic properties of our confidence intervals: see Remark 13.31 for further details. 

The problem of estimating the self-similarity indices, associated with Gaussian and non-Gaussian 
stochastic processes, is crucial in applications, ranging from time-series, to physics and mathematical 
finance (see e.g. [IT] for a survey). This issue has generated a vast literature: see [TJ and [5] for some 
classic references, as well as [5], [7J, [E], [IS], and the references therein, for more recent discussions. 
However, the results obtained in our paper seems to be the first non-asymptotic construction of 
a confidence interval for the Hurst parameter H. Observe that the knowledge of explicit non- 
asymptotic confidence intervals may be of great practical value, for instance in order to evaluate 
the accuracy of a given estimation of H when only a fixed number of observations is available. In 
order to illustrate the novelty of our approach (i.e., replacing CLTs with concentration inequalities 
in the obte! ntion of confidence intervals), we also decided to keep things as simple as possible. In 
particular, we defer to a separate study the discussion of further technical points, such as e.g. the 
optimization of the constants appearing in our proofs. 

The rest of this short note is organized as follows. In Section[2]we state a concentration inequality 
that is useful for the discussion to follow. In Section [3] we state and prove our main result. 



2 A concentration inequality for quadratic forms 

Consider a finite centered Gaussian family X = {Xk : k = 0, M}, and write R(k, I) = E(XkXi). In 
what follows, we shall consider two quadratic forms associated with X and with some real coefficient 
c. The first is obtained by summing up the squares of the elements of X, and by subtracting the 
corresponding variances: 

M 

Q 1 (c,X)=cJ2(^-R(k,k)); (2.1) 

k=0 

the second quadratic form is 

M 

Q 2 {c, X) = 2c 2 J2 x kXiR{k, I). (2.2) 

fc,2=0 

Note that Q2(c,X) > 0. It is well known that, if Qi(c,X) is not a.s. zero, then the law of Qi(c, X) 
admits a density with respect to the Lebesgue measure (this claim can be easily proved by observing 
that Qi(c, X) can always be represented as a linear combination of independent centered % 2 random 
variables - see |14) for a general reference on similar results). The following statement, whose 
proof relies on the Malliavin calculus techniques developed in [TU] , characterizes the tail behavior of 

Qi(c,x). 

Theorem 2.1. Let the above assumptions prevail, suppose that Q\(c,X) is not a.s. zero and fix 
a > and (3 > 0. Assume that Q2(c, X) < aQi(c, X) + f3, a.s. -P. Then, for all z > 0, we have 

z 2 \ . , „, , / z 1 



P(Qi(c, X) > z) < cxp \- 2az + 2f3 J and P(Qi(c, X) < -z) < exp \- ^, 



In particular, P(\Qi(c, X)\ > z) < 2 exp ^~ 



2az+2(3 
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Proof. In this proof, we freely use the language of isonormal Gaussian processes and Malliavin 
calculus; the reader is referred to [TTJ Chapter 1] for any unexplained notion or result. Without loss of 
generality, we can assume that the Gaussian random variables X k have the form X k = X(h k ), where 
X(Sj) — {X(h) : h E Sj} is an isonormal Gaussian process over f) = M. M , and {h k : k = 1, M} is a 
finite subset of 9) verifying 

E[X{h k )X(hi)]=R{k,l) = (h k ,h l )x. 

It follows that Qi(c,X) = ^(c^fcio h k ® hk), where I2 stands for a double Wiener-Ito stochastic 
integral with respect to X, so that the Sj- valued Malliavin derivative of Qi(c, X) is given by 

M 

DQ 1 {c,X) = 2cY / X(h k )h k . 

k=0 

Now write L _1 for the pseudo-inverse of the Ornstein-Uhlenbeck generator associated with X(Sj). 
Since Qi(c, X) is an element of the second Wiener chaos of X($)), one has that L~ 1 Qi(c, X) = 
— \ Qi(c, X). One therefore infers the relation 

(DQ 1 (c 1 X),~DL- 1 Q 1 (c,X))^ = ^\\DQ 1 (c,X)\\l = Q 2 (c,X). 

The conclusion is now obtained by using the following general result. □ 

Theorem 2.2. (See \10\ Theorem 4.1]). Let X(Sj) — {X(h) : h E Sj} be an isonormal Gaussian 
process over some real separable Hilbert space f). Write D (resp. L^ 1 ) to indicate the Malliavin 
derivative (resp. the pseudo-inverse of the generator L of the Ornstein-Uhlenbeck semigroup). Let Z 
be a centered element o/D 1,2 := donxD, and suppose moreover that the law of Z has a density with 
respect to the Lebesgue measure. If, for some a > and ft > 0, we have 

(DZ, -DL- l Z)f, <aZ + f3, a.s.-P, 

then, for all z > 0, we have 

P(Z > z) < exp ( \ and P(Z < -z) < exp ( -— \ . 

Remark 2.3. One of the advantages of the concentration inequality stated in Theorem 12.11 (with 
respect to other estimates that could be obtained by using the general inequalities by Borell [2]) is 
that they only involve explicit constants. 



3 Main result 

We go back to the assumptions and notation detailed in the Introduction. In particular, B is a 
fractional Brownian motion with unknown Hurst parameter H E (0,H*], with H* < 1 known. The 
following result is the main finding of the present note. 

Theorem 3.1. Fix n > 1, define S n as in II 1.1)) and fix a real a such that < a < (4 — 4 H ) v / n. 
For x E (0, 1), set g n (x) = x — lo ^~^ ^ . Then, with probability at least 



<f{a) 



1 — 2 exp 



(3.1) 
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(where [■]+ stands for the positive part function) , the unknown quantity g n (H) belongs to the following 
confidence interval: 



I(n) = [I l (n),I r (n)} 



1 \0gS n , l0 § (l - (4-4**)^ ) . 1 logS„ , (l 



(4-4"*)v^ 



2 2 log n 2 log n 2 2 log n 2 log n 



Remark 3.2. 1. We have that linin^oo g n {H) = H. Moreover, it is easily seen that the asymp- 
totic relation (|1.2[l implies that, a.s.-P, 

lim Mn) = lim I r {n) = H, (3.2) 

n — >oo n — >oo 

that is, as n — > oo, the confidence interval /(n) "collapses" to the one-point set {H}. 

2. In order to deduce (from Theorem 13. lj) a genuine confidence interval for H, it is sufficient to 
(numerically) inverse the function g n . This is possible, since one has that g' n (x) > I for every 
a; G (0,1), thus yielding that g n is a continuous and strictly increasing bijection from (0,1) 
onto (— log 3/ (2 logn), +oo). It follows from Theorem 13.11 that . with probability at least ip(a), 
the parameter H belongs to the interval 

J(n) = [J,(n), J r (n)} = [s" 1 ^)); s" 1 ^))] , 

where u(n) — max{/;(n); — log 3/(2 logn)}. Observe that, since relation (13. 2p is verified, one 
has that h{n) > — log 3/(2 logn), a.s.-P, for n sufficiently large. Moreover, since g^ 1 is 1- 
Lipschitz, we infer that 

J r (n) - Ji (n) < I r (n) - I t (n) = — ?— log ^ ' 4 4 " K ' " + " 



21ogn 6 A H *)^fn~- a 

so that, for every fixed a, the length of the confidence interval J(n) converges a.s. to zero, as 
n — > oo, at the rate 0(l/( v / nlogn)). 

3. We now describe how to concretely build a confidence interval by means of Theorem 13.11 
Start by fixing the error probability e (for instance, e — 0,05 or 0,01). One has therefore two 
possible situations: 

(i) If there are no restrictions on n (that is, if the number of observations can be indefinitely 
increased), select first a > in such a way that 



^{-nWTY))-2 (3 - 3) 

(ensuring that f(a) > 1 — e). Then, choose n large enough in order to have 

< 1 and log )- ' V < L, 



(4-4 H *)V^ 21ogn 6 \(4:~ 4 H *)y/E- a y 

where L is some fixed (desired) upper bound for the length of the confidence interval, 
(ii) If n is fixed, then one has to select a > such that 



,2 



exp t r < - and a < (4 - 4 

v 7i (^+ 3 );" 2 

If such an a exists (that is, if n is large enough), one obtains a confidence interval for H of 
length less or equal to 2^ log ( {jiff jffi^ )- 
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4. The fact that we work in a non-asymptotic framework is reflected by the necessity of choosing 
values of a in such a way that the relation f|3. 3|) is verified. On the other hand, if one uses 
directly the CLT (|1.4[) (thus replacing Z n with a suitable Gaussian random variable) , then one 
can define an asymptotic confidence interval by selecting a value of a such that a condition of 
the type 

exp(— est x a 2 ) < e 

is verified. 

5. By a careful inspection of the proof of Theorem 13.11 we see that the existence of H* is not 
required if we are only interested in testing H < H for a given H . 

Proof of Theorem 13.11 Define X n = {X n ^ : k = 0, n — 1}, where 

X n = B k + 2 — 2B fe + l -\- B k_ . 

By setting 

PH {r) = ~( - \r - 2\ 2H + 4|r - 1\ 2H - 6\r\ 2H + 4|r + l\ 2H - \r + 2\ 2H ) 7 r e Z, 

one can prove by standard computations that the covariance structure of the Gaussian family X n is 
described by the relation E(X n ^X n: i) = pn{k — l)/n 2H . Now let Z n be defined as in (|1.3[) : it easily 
seen that Z n = Qi(n 2ff_1 / 2 , X n ) (as defined in (|2.ip ). We also have, see formula ()2.2|) : 

Q 2 (n 2H -^ 2 ,X n ) = 2n iH -^X ntk Xn / H{ ^~ l) 

k,l=0 
n-1 

< 2n 2H - x \X nik \\X nil \\p H {k-l)\ 

k.l=Q 

n-1 

k,l=Q 

71—1 71—1 

= 2n 2H - x £ Xl k \p H {k-l)\<2n 2H -^Xl k Y,\PH(r)\ 

k,l=0 k=0 rSZ 

< \J2\p H (r)\ j (^» + 3VH) =a n Z n + (3 (3.4) 



with 



a n = —S^\ PH { r )\ and = 6 V" \Ph{t)\- (3.5) 
Since Z n ^ 0, Theorem 12.11 applies . yielding 

P(W> ° )S "-(- «I^Jr)l(jS, + 3) )- <3 ' 6) 
Now, let us find bounds on X)rez l/°-tf( r )l that are independent of ii" . Fix r > 3. Using 

v-^ a(o! — 1) . . . (a — k + 1) i 

1 + m q = 1 + V — ^—7 — -7/ for0<u<l. 

z — ' k\ 

k=l 
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we can write 



= ^ £ = ^ i2H - k + 1 \- (-2)* + 4(-l)* + 4 - 2*)r-* 

fe=i 



/=i 



2« V 2g(2g-l)---(2g-2i + l) _ h „_ 2i 
^ (20! 1 



Note that the sign of 2H(2H - 1) • • ■ (2H - 21 + 1) is the same as that of 2H - 1 and 

\2H(2H -l)---(2H-2l + l)\ = 2H\2E - l|(2 - 2H) ■ ■ ■ (21 - 1 - 2H) 

< 2(2/ -1)!. 

Hence, we can write 



+2° A l 

\Ph{t)\ < r 2 "£ 



r - l 



I 

1=1 



4r 2H log ( 1 - — ) - r 2H log ( 1 - -1 ) (since log(l - ti) = - £~ t x if < u < l) 
243 



< ^ r 2H ~ 4 (since 4 log(l - u) - log(l - 4it) < ^u 2 if < u < §) 
243 



< — r- 2 . 
- 20 

Consequently, taking into account of the fact that pu is an even function, we get 

oo 

£|M0I < |p ff (0)| + 2| PH (l)|+2|pH(2)| + 2ElmM| 

oo 

= |4 — 4 ff | + |4 x A H — 9 H — 7| + |4 — 6 x 4 H + 4 x 9 H — 16 H \ + 2^\p H (r)\ 



r=3 



< 3 + 4 + 1 + if fy- 1 ^) l7 ' r,,) -- l7 ' 7r *' 



Putting this bound in (|3.6p yields 

P(\Z n \ >a)< 2exp (- —^-^ j . (3.7) 

Note that the interest of this new bound is that the unknown parameter H does not appear in the 
right-hand side. Now we can construct the announced confidence interval for g n (H). First, observe 
that Z n — n 2H ~z S n — (4 — A H )^/n. Using the assumption H < H* on the one hand, and (|3.7p on 
the other hand, we get: 

1 logS„ , kg (l- (4-4^ ) ^ , m 1 logS„ , l°g(l 



(4-4^* 



2 21ogn 21ogn _a v y_ 2 21ogn 21ogn 
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;)| 1 \0gS n | lp g (* ~ <H l0g(4-4 g ) < 1 logS w | lQ g (l + (4-4* )v /n) 

2 2 log n 2 log n 2 log n — 2 2 log n 2 log n 

p (l logS„ | log((4-4 g )V^-a) 1 logS„ , log((4~4 ff )^+a)^ 



4 2 log n 2 log n 4 2 log n 2 log n / 

= P(\ZJ < a) > 1 - 2exp f r | 

which is the desired result. □ 



Remark 3.3. The fact that Q 2 {n 2H - 1 / 2 , X n ) < a n Z n + (3 (see fO^I). where a n -> and /3 > 0, 

is consistent with the fact that Z n N(0,ch), and Q 2 (n 2H ~ 1 / 2 , X n ) = |||DZ n |||, where £>Z„ 
is the Malliavin derivative of Z„ (see the proof of Theorem 12. ip . Indeed, according to Nualart and 

Ortiz-Latorre [T3j, one has that Z n ^5 N(0,Ch) if and only if |||DZ„||1 converges to the constant 
Ch in L 2 . See also [5] for a proof of this fact based on Stein's method. 
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